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Abstract 

o 

The pumping lemma for context-free languages is a result about push- 
I down automata whicii is strikingly similar to the well-known pumping 

^ lemma for regular languages. However, though the lemma for regular 

languages is simply proved by using the pigeonhole principle on deter- 
^SJ ministic automata, the lemma for pushdown automata is proven through 

7—i an equivalence with context-free languages and through the more pow- 

erful Ogden's lemma. We present here a proof of the pumping lemma 
' ' for context-free languages which relies on pushdown automata instead of 

context-free grammars. 

^ 1 Setting 

T-H The pumping lemma for regular languages is the following well-known result: 

> 

^\ Theorem 1. Let L be a regular language over an alphabet E. There exists 

some integer p > 1 such that, for every w ^ L such that \w\ > p, there exists a 
decomposition w — xyz such that: 

1- \xy\ < p 

^ 2- \y\ > 1 



X 



3. Vn > 0, xy^-z e L 

The pumping lemma for context-free languages |BHPS6l"| , also known as the 
Bar-Hillel lemma, is the following similar result: 

Theorem 2. Let L be a context-free language over an alphabet S. There exists 
some integer p > 1 such that, for every w d L such that \w\ > p, there exists a 
decomposition w = uvxyz such that: 

1. \vxy\ < p 

2. \vy\ > 1 

3. Vn > 0, uv'^xy'^z e L 
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One would expect the classical proofs of these results to be similar. However, 
this is not the case. The pumping lemma for regular languages [HU79] is usually 
proved through the equivalence between regular languages and finite automata 
by picking a deterministic automaton A which recognizes the language L ; we 
can then use the fact that the accepting path of any word w longer than the 
number of states of A must pass by the same state twice (by the pigeonhole 
principle), yielding the points at which we can decompose w. The pumping 
lemma for context-free languages, however, is usually derived from Ogden's 
lemma |Ogd68| which is itself proved by examining context-free grammars and 
not pushdown automata (using the equivalence of these two formalisms). 

It seems reasonable to hope that the pumping lemma for context-free lan- 
guages can be proved directly from the properties of pushdown automata, with 
no reference to context-free grammars. In the next section, we propose such a 
proof. Though the underlying ideas that we introduce in this proof are appar- 
ently part of the folklore, we are not aware of existing published work on this 
topic. 

2 Proof 

Let i be a context-free language over an alphabet E. Let ^ be a pushdown 
automaton which recognizes L, with stack alphabet F. We denote by \A\ the 
number of states of A. To simplify the reasoning, we will impose the following 
condition on A: all transitions of A pop the topmost symbol of the stack and 
either push no symbol on the stack or push on the stack the previous topmost 
symbol and some other symbol. It is easy to see that any pushdown automata 
which pushes arbitrary sequences of symbols on the stack can be rewritten in 
this fashion by replacing its transitions by an initial pop transition followed by 
a sequence of e-transitions pushing the appropriate symbols on the stack. 

We define p' = \A\'^\T,\ and define the pumping length to bep = |A|(|r|-M)p'. 
We will now show that all w £ i such that |w| > p have a decomposition of the 
form w — uvxyz such that \vxy\ < p, \vy\ > 1 and Vn > 0, uv'^xy^z E L. 

Let w E L such that \w\ > p. Let tt be an accepting path of minimal length 
for w (represented as a sequence of transitions of A), we denote its length by 
|7r|. We can define, for < i < |7r|, the size of the stack at position i of the 
accepting path. For all iV > 0, we will define an iV-level over tt as a set of 
three indices k with 0<i<j<k<p such that the stack grows by N 
symbols between i and j and shrinks by N symbols between j and k. Formally, 
we require that: 

1. = Sk,Sj = s., + N 

2. for all n such that i<n<j,Si<Sn< Sj 

3. for all n such that j < n < k, Sk < Sn < Sk- 

We define the level Z of tt as the maximal iV such that tt has an A^-level. 
This definition is motivated by the following observation: if the size of the stack 
over a path tt becomes larger than its level I, then the stack symbols more than 
I levels deep will never be popped. Formally, we define the configurations of 
A as the couples of a state of A and a sequence of I stack symbols (where stacks 
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Figure 1: Illustration of the construction for case 2. To simplify the drawing, 
the distinction between the path positions and word positions are omitted. 



of size less than / with be represented by padding them to / with a special blank 
symbol, which is why we use |r| + 1 when defining p). By definition, there 
are |A|(|r| + 1)' such configurations. Essentially, A acts as a finite automaton 
without stack between the configurations. 

We can now distinguish two cases: either the level is low and the number of 
configurations is small, or the level is high. Formally: 

1. I < p' and, by the pigeonhole principle, the same configuration is encoun- 
tered twice in the first p + 1 steps of tt, 

2. I > p' and, by the pigeonhole principle, we will prove that a certain notion 
of full state is repeated for two different stack sizes in any /-level of w. 

Case 1. I < p' . In this case, the number of configurations is less than p. 
Hence, in the p -I- 1 first steps of tt, the same configuration is encountered twice 
at two different positions, say i < j. Denote by i (resp. j) the position of the 
last letter of w read at step i (resp. j) of tt. We have i < j. Hence, we can 
factor w = uvxyz with yz ^ e, u = Wq...-~, v = x = wj,„^^^. (By Wx...y 

we denote the letters of w from x inclusive to y exclusive.) By construction, 
\vxy\ < p. 

We also have to show that Vn > 0, uv"xy"z = uv^x e L, but this follows 
from our observation above: stack symbols deeper than I are never popped, so 
there is no way to distinguish configurations which are equal according to our 
definition, and an accepting path for uv"x is built from that of w by repeating 
the steps between i and j, n times. 

Finally, we also have \v\ > 0, because if v = e, then, because we have the 
same configuration at steps i and j in tt, tt' = ttq.. .iiTj...^.^^ would be an accepting 
path for w, contradicting the minimality of tt. 

Case 2. I > p' . Let i,j,k be a p'-level. To any stack size h, Si < h < sj, 
we associate the last push \p{h) — max({y < j\sy = h}) and the first pop 
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fp(/i) = min({t/ > j\sy = h}). By definition, i < \p{h) < j and j < fp(/i) < k. 
We say that the full state of a stack size h is the triple formed by: 

1. the automaton state at position lp{h) 

2. the topmost stack symbol at position lp(/i) 

3. the automaton state at position fp(ft.) 

There are p' possible full states, and p' + 1 stack sizes between Si and Sj , so, 
by the pigeonhole principle, there exist two stack sizes g, h with Si < g < h < Sj 
such_that the full_states at ^and h are the same. Like in Case 1, we define 
by Ip (g), Ip (h), fp (h) and fp (g) the positions of the last letters of w read at 
the corresponding positions in tt. We factor w = uvxyz where u \p(gy 

This factorization ensures that \vxy\ < p (because k < phy our definition of 
levels). 

We also have to show that Vn > 0,uv^xy"z G L. To do so, observe that 
each time that we repeat v, we start from the same state and the same stack 
top and we do not pop below our current position in the stack (otherwise we 
would have to push again at the current position, violating the maximality of 
Ip(S'))) so we can follow the same path in A and push the same symbol sequence 
on the stack. By the maximality of lp(/i) and the minimality of fp(/i), while 
reading x, we do not pop below our current position in the stack, so the path 
foIlow(xl in the automaton is the same regardless of the number of times we 
repeated v. Now, if we repeat w as many times as we repeat v, since we start 
from the same state, since we have pushed the same symbol sequence on the 
stack with our repeats of v, and since we do not pop more than what v has 
stacked by minimality of ip{g), we can follow the same path in A and pop the 
same symbol sequence from the stack. Hence, an accepting path from uv^xy^z 
can be constructed from the accepting path for w. 

Finally, we also have \vy\ > 1, because like in case 1, ii v = e and y = e, 
we can build a shorter accepting path for w by removing TTip(g)...ip(h) and 

7rfp(h)...fp(g). 

Hence, we have an adequate factorization in both cases, and the result is 
proved. 
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